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ickground of Invention 

The present invention relates generally to the field of electronic imaging, and 
more particularly, to a method and apparatus for reconstructing images of documents 
produced using color filter array sensors. 

Most electronic color cameras employ a single sensor on which a periodic pattern 
of dyes of different colors is superimposed. This pattern is known as a color filter 
array (CFA). Many other color-sampling schemes are possible, such as prisms plus 
multiple-sensors, random color arrays (as in the human eye), or sampling different 
colors at different times. However the application of these schemes in consumer 
digital photography is limited by factors such as the cost of accurate mechanical 
alignment, fabrication cost, processing complexity, and the degree of motion present 
in real scenes. 

A CFA sensor is used to simultaneously capture three sets of image data in three 
primary color planes or channels, such as red (R), green (G) and blue (B). A basic CFA 
sensor is a single two-dimensional array of sensors that are covered by a pattern of 
red, green and blue filters, such as the Bayer pattern primary CFA illustrated in Figure 
1 . The output of a CFA sensor which records only a single color channel at each pixel 
or senor location is defined herein as a CFA image. 

Image processing is used to combine the three primary color planes to create a 
full-color image from a CFA image. Although the present invention is described while 
referring to the Bayer pattern CFA, those skilled in the art will appreciate that images 
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obtained from other CFA patterns can be reconstructed using the method and 
apparatus described herein. An example of another CFA pattern is the Hitachi 
complementary color (i.e., cyan (C), magenta (M), yellow (Y), and green (G)) CFA, which 
is illustrated in Figure 2. 

[0005] As illustrated in Figure 3, an approximate full-color full-resolution image stored 
in memory 306 is generally obtained today using CFA sensors forming part of an 
image recording module 302 by interpolating the values (e.g., red, green and blue) of 
the colors that were not sampled at any given pixel in the sensor in interpolator 304. 
For a Bayer pattern, two forms of interpolation are required: one for synthesizing 
missing green information, at sites where blue or red are sampled (i.e., luminance 
reconstruction), and the other for synthesizing missing red/blue information at sites 
where green or blue/red were sampled (i.e., chrominance reconstruction). 

[0006] Since CFA sensors like the Bayer pattern sample chrominance less frequently than 
B luminance, there is more risk of generating aliasing artifacts during chrominance 

u reconstruction. Consequently, CFA signal processing has been improved over time to 

correct such artifacts created during chrominance reconstruction. For more 
background on CFA imaging and reconstruction see for example the following U.S. 
3; Patent Nos., which are incorporated herein by reference: 3,971 ,065; 5,382,976; 

5,506,619; 5,629,734. 



Pi 



[0007] Signal processing techniques have also been developed for luminance 

reconstruction. Three common methods for luminance reconstruction are: bilinear 
interpolation, which takes the average of four surrounding green pixels; low-pass 
filtering, which finds the convolution of the greens with a finite window approximation 
of a sine function at forty five degrees to the rows and columns of the sensor; and 
median interpolation, which takes the average of the two middle-ranking surrounding 
green pixels. 



[0008] 



Because these known methods of luminance reconstruction introduce artifacts, 
notably in the vicinity of edges, due to the indiscriminate nature of the averaging that 
they apply, decision directed interpolation methods have been devised that select an 
interpolator on the basis of locally detected patterns. Examples of the features that 
have been employed for deciding which is the appropriate interpolator include: the 
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local-average-thresholded image, and the local gradient and Laplacian values. For 
additional background on methods aimed at reducing chrominance artifacts see for 
example the following U.S. Patent Nos., which are incorporated herein by reference: 
4,642,678; 4,716,455; 4,716,455; 4,724,395; 4,803,548; 5,629,734; 5,373,322; 
5,778,106. 

[0009] Since existing luminance reconstruction methods employ spatial interpolation to 
reconstruct green channel information, it is inevitable that the effective resolution of 
CFA reconstructed images containing many small edges, like text in document 
images, will be lower than images from an equivalently sized gray-scale sensor. This 
results in greatly reduced legibility of document images for both machines and 
humans. Accordingly, it would be desirable to provide an electronic camera imaging 
system that reconstructs missing green channel values in CFA images using other 
^ than spatial interpolation to avoid such resolution-loss. 

m Summary of Invention 

[001 0] In accordance with the invention, chrominance samples of a color filter array (CFA) 

O 

fg sensor are used to reconstruct a higher-resolution luminance channel by making 

L, assumptions about the relation between different color channels. Advantageously, 

{=* reconstructed CFA images recording document images have improved legibility 

™ because high spatial frequency information about character shapes in document 

D images is retained by the reconstruction method of the present invention. 

[001 1] In accordance with one aspect of the invention there is provided an apparatus, and 
method therefor, for reconstructing CFA images. An image-recording module 
generates an image of sampled values output from a color filter array. Each pixel in 
the image has a location at which a sampled value of one of a plurality of color 
channels is recorded. An image reconstruction module linearly transforms the 
sampled value at the location of a selected pixel in the image to estimate a color 
channel not recorded by the image recording module at the location of the selected 
pixel in the image. That is assuming a lens free of chromatic aberration, the linear 
transformation is used to estimate missing color channels at the selected pixel 
location using the sampled value at that location. 
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[001 2] In accordance with another aspect of the invention, coefficients of the linear 
transformation are estimated in a first embodiment by computing statistics that 
depend on the sampled value but not the position of the samples of at least two color 
channels within a window of the image. According to this embodiment, the 
coefficients can be computed by, for example, computing an average value over a 
window of the image by equalizing mean and variance of each of the color channels of 
the window. 

[001 3] In accordance with yet another aspect of the invention, coefficients of the linear 
transformation are estimated in a second embodiment by fitting a line using 
regression to at least one set of sums of color channels along lines through a window 
of the image. For example, sets of sums of the red color channel of the Bayer pattern 
are determined by computing a regression using pairs of red and green sums in red 
rows. 



m Brief Description of Drawings 



H= [001 4] These and other aspects of the invention will become apparent from the following 

O 

m description read in conjunction with the accompanying drawings wherein the same 

L reference numerals have been applied to like parts and in which: 

O 
M= 

u, [001 5] Figure 1 illustrates the pattern of the primary CFA for a Bayer pattern; 



■U 



3 [001 6] Figure 2 illustrates the pattern of the CFA for the Hitachi complementary color 
pattern; 

[001 7] Figure 3 illustrates a block diagram of a CFA electronic camera imaging system 
with an image interpolator as is known in the art; 

[001 8] Figure 4 illustrates a block diagram of an electronic camera imaging system 
incorporating the present invention; 

[001 9] Figure 5 illustrates a flow diagram that sets forth the steps performed by the 
image reconstruction module shown in Figure 4 to produce a reconstructed image 
from a CFA image; 

[0020] 

Figure 6 illustrates a document image defined using green and red color channels 
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that forms the characters "text" therein; 

[0021] Figure 7 illustrates a section of the document image shown in Figure 6; 

[0022] Figure 8 illustrates a graph of the green color channel and the red color channel 
as they change along the column-axis of the section of the document image shown in 
Figure 7; 

[0023] Figure 9 illustrates a graph that plots values of the green color channel against the 
values of red color channel at a corresponding column location as shown in the graph 
in Figure 8; 

[0024] Figure 1 0 is a flow diagram of an alternate embodiment of the flow diagram 

shown in Figure 5 in which the step 504 in Figure 5 is substituted for the steps 510, 
Q 512, 514, 516, and 518 shown in Figure 10; 

jfj [0025] Figure 1 1 illustrates an image of the letter "a"; 

y& [0026] Figure 1 2 illustrates a plot of color sums computed along rows of the image 
rr shown in Figure 1 1 ; 

ULi 
s 

□ [0027] Figure 1 3 illustrates a plot of color sums computed along columns of the image 

fT shown in Figure 1 1 ; and 

p [0028] Figure 14 illustrates the regression of a line that estimates coefficients of the red 

^ channel using the sums shown in Figures 12 and 13. 

Detailed Description 

[0029] /. Overview 



[0030] 



Referring now to the drawings, Figure 4 illustrates a block diagram of an 
electronic camera imaging system 400 incorporating the present invention. The 
system 400 includes in part an image recording module 302 that outputs a CFA image 
to an image reconstruction module 404. It will be appreciated that in alternate 
embodiments additional processing may be performed before the CFA image is output 
to the image reconstruction module 404. For example, the CFA image may be 
compressed, transmitted and decompressed prior to reaching the image 
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reconstruction module 404. The reconstruction module 404 estimates missing color 
channel values by linearly transforming sampled color channel values in the CFA 
image. The output of the image reconstruction module 404 is stored in memory 406 
as a full-color or gray-scale full-resolution reconstructed image. It will be appreciated 
that in alternate embodiments additional processing may be performed before the 
image is stored in the memory 406. For example, the image may be either 
compressed or sub-region sampled prior to being stored in memory 406. 

[0031] As known in the art, the image recording module 302 includes in part an image 
sensor for recording an image directed through conventional optics. The image filter 
is covered by a CFA, such as the Bayer filter illustrated in Figure 1 . When the image 
sensor is exposed to light, each pixel value in the image sensor records either a red, 
green, or blue color value depending on the color filter applied to that pixel location. 
J These color values are converted using a conventional analog to digital converter to 

Cm generate a CFA image, which is input to the image reconstruction module 404. 



[0032] A processor (not shown) with operating memory (not shown) controls the 

operation of the system 400. The processor receives input signals from a user to 
record an image by engaging the image recording module. Once part or all of the CFA 
image is produced, the processor engages the image reconstruction module 404. The 
reconstructed image is stored in memory 406 for later display or transfer to another 
device. 

[0033] Figure 5 illustrates a flow diagram that sets forth the steps performed by the 
image reconstruction module 404 to produce a reconstructed image from a CFA 
image. Initially at step 500, the CFA image is acquired from the image recording 
module 302. Subsequently at step 502, the CFA image is divided into windows having 
a predefined size "W M . In one embodiment, the window size is 32x32 pixels that 
overlap neighboring windows on each side by 16 pixels. At step 504, coefficients of a 
linear transformation are estimated for each of the windows of the CFA image 
identified at step 502. 

[0034] At step 5Q5 ( t h e |j near transformation is applied to each pixel of the windows in 
the CFA image defined at step 502 using the coefficients estimated at step 504 to 
compute samples of missing color channels in the original document image. In the 
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embodiment specified above with a 32x32 window size, each window overlaps 
neighboring windows by 1 6 pixels so that the linear transformation is applied to only 
the center 1 6 pixels of fully overlapped pixels windows at step 506, At step 508, a 
full-color or gray-scale image is reconstructed by combining the observed CFA 
samples with the samples estimated at step 506. 

[003 5] 2. Intuition For Using A Linear Transformation 

[0036] The linear transformation specified at step 506 shown in Figure 5 is defined for 
two color channels. It is assumed herein that within a small enough region on the 
surface of the object being imaged (for example a document with text), there exist 
combinations of materials with only two distinct diffuse reflectance functions (for 
example white paper and a combination of color toners) and a constant illuminant 
color. When these reflectances are multiplied by the illuminant, two colors, namely the 
foreground and background colors are formed. It is further assumed that the imaging 
process consists of a blurring and sampling process in which the blurring is linear and 
independent of color. Advantageously, the present invention exploits this local two- 
color property to retain the resolution of a gray-scale sensor while using a CFA color 
sensor. Rather than making spatial assumptions and interpolating missing CFA 
sampled color values, the linear transformation specified at step 506 is used to 
linearly transform sampled color values to missing color values by making color 
coherence assumptions. 

[0037] The intuition for defining a linear relationship between a foreground and a 

background color is illustrated in Figures 6-9. Figure 6 illustrates a document image 
600 with the characters "text" set forth therein, which is defined using green and red 
color channels. Figure 7 illustrates the section 602 of the document image 600 shown 
in Figure 6. Figure 8 illustrates a graph of the green color channel 604 and the red 
color channel 606 as they change along the column axis of the section 602 of the 
document image 600 shown in Figure 7. Figure 9 illustrates a graph that plots values 
of the green color channel against the values of red color channel at a corresponding 
column location as shown in the graph in Figure 8. For example, the green color 
channel value 604 and corresponding red color channel value 606 on the graph shown 
in Figure 8 specify sample point 608 on the graph shown in Figure 9. 
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[0038] As shown in Figure 9, there exists a linear relationship between the colors forming 
the section or portion 602 of the document image 600 shown in Figures 6 and 7. 
Exploiting this linear relationship permits luminance values to be estimated using a 
linear transformation of sampled chrominance values in a CFA image. 

[0039] 3. Model For Estimating The Linear Transformation 

[0040] To derive estimators for the linear transformation set forth in step 506 shown in 
Figure 5, the following model is defined for the pixel value of each color channel R, G, 
B that would be sampled at position, (y,x), were that position is the center of an R/G/B 
pixel. As defined herein "y" denotes the row of an image and "x" denotes the column 
of the image. It is assumed that there is only diffuse reflection and no noise and that 
the pixel value generation process is linear (i.e., the camera's gamma equals one). 
Under these assumptions, to a good approximation, any sampled pixel value may be 
specified as the integral over wavelength of the product of the object (e.g., paper plus 
ink or toner) reflectance, the color filter transmission spectrum, the sensor element 
sensitivity spectrum, and the illuminant spectrum. 

[0041] For a pure background point, this integral is factored as a product of a (scalar) 

diffuse illuminant intensity D(y,x) with a color vector with components K , K , K . 

r g b 

Likewise, the pixel value sampled at a pure foreground point may be written as the 

product of D(y,x) with a foreground color vector (F , F , F ). The effects of blur 

r g b 

imply that any sampled value will be a linear combination of the foreground and 
background pixel values. If the blur is the same for each channel at any given point, 
then the linear combination can be described by one scalar function <x (y,x) for all 
channels, which has value one at pure foreground points and zero at pure background 
points. Under these assumptions, the pixel values that would be observed for each 
channel may be written as follows: 



[0042] 



[0043] 



R{y, x) = D(y, x) {ct{y,x)F r + (1 - ct{y y x))K r ) , 
G{yx) = D{y, x) {a(y, x)b\ + (1 - a(y, x))K t ), 
B(y.x) = D(y, x) (a(y,x)F b + (1 - a{y,x))K b ) . 



Using the equations above, G(y,x) is estimated at a point where only R or B is 
known. To use the above equations directly it is necessary to know six constants (i.e., 
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F , F , F , K , K i K ) and two scalar functions (i.e., D(y,x) and <x (y,x)). In 
r g b r g b 

camera images of documents, since lighting variation (i.e., diffuse illuminant intensity 

D(y,x) ) is usually slowly varying, D(y,x) is relatively easy to estimate. However, the 

pure foreground pixel value (i.e., F , F , F , ) is hard to estimate due to the 

r g b 

thinness of character strokes in document images, and the function <x (y,x) is rapidly 
varying. This suggests the elimination of a (y,x) by simultaneously solving for 
example the equations above to estimate G and R, and G and B as follows: 



[0044] 



K — F ' 

G{y, x) = ' ' (R(y, x) - K r D{y, x)) + K D(y, x) t 
K r -h r 

K — F 

G(y, x) = f- (B(y, x) - K b D(y, x)) + K g D(y, x). 

[0045] In one embodiment (hereinafter referred to as "Embodiment A"), these equations 
^ may be rewritten in terms of easier-to-estimate foreground and background constant 

m parameters "a ^ " and "b " (with color channels C, i.e., R, G, B), local samples R(y,x), G 

2 ( y,x ) j ancl B (y» x )> anc * the function D(y,x) as follows: 

u 

Q [0046] 

^ G(y,x)=a r R(y>x) + b t D(y,x) s 

S G(y, x) = a b B(y y x) + b h D{y, x). 



n 



I 5 



[0047] In another embodiment (hereinafter referred to as "Embodiment B"), it is often the 

^ case that the background color D(y,x) is sufficiently slowly varying in a document 

Q 

image that it can be considered constant within a defined window size. In Embodiment 



B, the equations above may be formulated in terms of foreground and background 
constant parameters "a ^ " and "b 1 " (with color channels C, i.e., R, G, B) and local 
samples R(y,x), G(y,x), and B(y,x), as follows: 



[0048] 



G(y,x) = a r R(y,x)+b,\ 
G(y,x) = a A B(y lX )+b b \ 



[0049] Embodiment B corresponds to the flow diagram shown in Figure 5, in which a 
window based approach is used to compute both foreground and background color 
coefficients "a" and "b f ". In one implementation of Embodiment B set forth above, a 
32x32 pixel window that overlaps by 16 pixels on each side is preferred. 
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•40050] Embodiment A corresWids to the flow diagram shown in Figure 1 0, in which step 
£f 504 in Figure 5 is substitutecWith steps 510, 512, 514, 516, and 518. Unlike 

Embodiment B, background coibr is determined at step 510 in Embodiment A. One 
method for determining background color is disclosed in U.S. Patent Application Serial 
No. 09/427,833 entitled "BackgrAind Surface Thresholding", which is incorporated 
herein by reference. As set forth therein, background color is determined by applying 
the disclosed method to each chanrael of a low-pass filter reconstructed image. At 
step 512, connected components ars found, for example, by applying background 
surface thresholding to the luminan< e of the low-pass reconstructed image. At step 
514, the foreground color coefficien s for each binary connected component are 
determined by applying one of the n ethods given below in Section 4 to the region of 
the raw-data corresponding to a dil< tion of the component. At step 516, a clustering 
H procedure is used to group compon ints that are in close spatial proximity and that 

j|: have similar estimated foreground color coefficients. This clustering step enables 

03 more accurate estimation of foreground color coefficients, particularly for small 

L. connected components. 



U 
m 

a 

ri 



[0051] In one embodiment at step 516, a region-growing segmentation is applied 
starting from a Voronoi graph of the connected components. Preferably, 
agglomerative clustering is used at step 516, in which a t-test is performed on the 
color coefficients of all neighboring pairs of components (or sets of previously merged 
components) and the pair that gives the best confidence value for the test is merged. 
This process is repeated until no pair has a sufficiently high confidence value. At step 
518, for each group of connected components, the foreground color coefficients are 
re-estimated. The final coloring of the missing color samples in the CFA image is 
again determined by a pixel-wise linear transformation at step 506 using the 
estimated foreground coefficients and the estimated local background color. 

[0052] 4. Estimating The Coefficients Of The Linear Transformation 

[0053] 

In this section two methods for estimating the coefficients "a" and "b" or "b' "of the 
linear transformations of Embodiments A and B defined above in Section 3. The first 
method, which is defined herein as the "variance method", is preferably used to 
estimate the coefficients for document images that do not contain many rapidly 
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[0054] 
[0055] 



rii 

£ [0056] 

a: 

ii S 

[0057] 
[0058] 



varying colors. The second method, which is defined herein as the "profiles method", 
is preferably used to estimate the coefficients for document images with a range of 
different colors of text. 

4. 1 The Variance Method 

The variance method selects the coefficients to equalize the mean \x and 
2 

variance a of each of the color channels C (e.g., R, G, B) within some window W 
(e.g., 32 x §2 pixels). That is, the variance method assumes that sets of pixels close 

together in some window W have similar first-order statistics (i.e., statistics that 
depend on the sampled value but not the position of the samples of at least two color 
channels within the window W), even though the underlying image may be highly non- 
stationary and non-smooth, as is typical of text. Given this assumption, a histogram 
of two color channels in the window W should have similar appearances. Thus, it 
follows that the coefficients "a" and "b' " of the linear transformations of Embodiment 
B under the variance method can be computed at step 504 in Figure 5 as follows: 



b \ = M G ~ a bM B - 

It will be appreciated by those skilled in the art that the variance method only 
determines the magnitude, but not the sign of the coefficients "a". The overlap of the 
spectral sensitivities of the color filter arrays and the nature of common illuminants 
and ink/toner reflectances makes it unlikely that the coefficients a are negative. 

The variance method would be optimal if all the pixels of an image were 
independently drawn from a normal distribution. However, this is a poor assumption 
for document images. Performance of the variance method degrades when windows 
are too small and contain strong, that is high gray-level gradient, horizontal or 
vertical edges. For example, large errors in estimating the coefficients results if the 
image sampling grid is positioned such that a column of alternating red/green pixels 
falls on the dark side of a strong vertical bar and the neighboring column of 
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alternating green/blue pixels falls on the light side of the bar. 

[0059] It will also be appreciated by those skilled in the art that the variance method can 
be similarly used to compute the coefficients "a" and "b" for Embodiment A by 
normalizing the observed colors with the estimated background color. That is, 
computing: 

[0060] 

R'(y,x) = R( yj x)/D( y> x), 
G\y,x) = G(y,x)/D(y,x) 1 
B'(y,x) = B(y,x)/D(y,x), 

[0061] and using the mean and variance of R\ G\ B' in the four equations at the start of 
this section. 

p [0062] 4.2 The Profiles Method 

01 [0063] The profiles method attempts to exploit the presence of strong edges by 
j~£ modeling the rows or columns of some image window W as level sets (i.e., regions of 

Jr constant pixel value). Given an image of a horizontal bar lying along a red/green row 

fB of an image, the sum of the pixel values of red pixels along the bar should equal the 

™ sum of the green pixels along the bar (plus or minus some corrections for sub-pixel 

H shifts in the ends of the bar). Given a set of such (e.g., green sum, red sum) pairs for 

p various rows or columns of an image window, a line is fit to them to determine the 

~ coefficients. 

[0064] More specifically using the profiles method, the coefficients of the red channel of 
the linear transformation in Embodiment B are computed at step 504 in Figure 5 in 
two steps. At the first step, the sums of red (R) and green (G) pixels along each row 
and column of a window W of a CFA image that contains red pixels is determined. 



[0065] 



More specifically, at the first step given a window W of a CFA image with M h" red 
rows and "w M red columns, these sums are considered as a vector of length h+w 
(hereinafter referred to as the "sums vector"), with all the row sums followed by all the 
column sums. In summations given here, the index is taken to range overall the 
pixels of the type indicated by the summand for the given window. The coordinate 
origin is chosen to be the top left of the window W, which is a red pixel. The red sum 
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and green sum for row k are: 



[0066] 



C 

lG(2*,c). 



[0067] 



Likewise, the red sum and green sum for column k are: 



[0068] 



r 

y h+k ="Z G ( r > 2k )- 



[0069] In the second step, a line is fit to a plot consisting of the red and green row sum 
pairs and column sum pairs in the sums vector by least squares linear regression. The 
results of the linear regression provide estimates for the coefficients of the linear 
transformations in Embodiments A and B. 



example applies the profile method to an image of a letter "a" with baseline at zero 
degrees to sampling grid 650 as shown in Figure 1 1 . The sums for red and green row 
sum pairs and column sum pairs are illustrated in Figures 12 and 1 3, respectively. 
Note that in Figures 12 and 1 3 a cross (e.g., cross 660) signifies a green sum on a red 
or blue line through the sampling grid 650 (e.g., column line 652 and row line 654), a 
circle (e.g., circle 662) signifies a red sum on a red line, and a triangle (e.g., triangle 
664) signifies a blue sum on a blue line (e.g., column line 656 and row line 658). In 
this example, the normalization coefficient determined by regression of line 680 of 
the plot of the green and red sum pairs is unity as shown in Figure 1 4. 

[0071] Note that an analogous procedure is used to estimate the coefficient for the blue 
channel using the blue sums on blue lines computed and illustrated in Figures 1 2 and 
13. It follows that in estimating coefficients for the blue channel, in addition green 
sums on blue lines must be computed. Given these the green and blue sum pairs, a 
regression of a plot of blue sums on blue lines and green sums on blue lines is used 
to compute the blue channel's coefficients. 

[0072] 43 Alternate Embodiments Of The Profiles Method 



1 [0070] 



Figures 11-14 illustrate an example of the application of the profiles method. This 
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[0073] 



%[0074] 



[0075] 



[0076] 



[0077] 



In an alternate embodiment, when one of the points on the line is known, for 
example when it is known that the background color is pure black (i.e., zero values for 
red, green, and blue), or has been normalized to be pure black by employing a 
separate background color estimation phase as in Embodiment A above, constrained 
linear least squares regression procedures may be employed. Appropriate background 
normalization or constrained least squares procedures for such cases will be evident 
to those skilled in the art. In a further alternate embodiment, a confidence measure is 
computed for each row or column sum pair and only pairs of sums with the best 
confidence measures are included in the linear regression. In yet a further alternate 
embodiment, a variance measure, rather than a confidence measure is computed for 
each line sum pair and a weighted linear regression applied to estimate the color 
coefficients using some subset of the pairs of sums. 

The following is an illustration of one combination of these alternate 
embodiments, in which the background color is assumed to be pure black and a 
confidence measure is used to decide which pairs of sums to include in the 
regression. Here the profiles method is adapted to account for rows or columns that 
may be poorly modeled as level sets and are best left out of the linear regression. This 
alternate embodiment of the profiles method accomplishes this by selecting rows or 
columns that have a low total variation relative to their sum. In this alternate 
embodiment, a confidence measure is determined for each row and column, by taking 
the ratio of the green sum to the total variation V of the green pixels in the row or 
column. For a row k, the total variation V is defined as: 

V k =YfiO-Kc)-G(2k,c-2)\ 

c 

The confidence measure, Q, is considered to be a vector of length h+w and 
includes a parameter measuring the strength of noise in the image, a , which is 
weighted by the square root of the number of pixels in the image row or column, n, as 
follows: 



& = 



yfncr + V k 
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[0078] The rows and columns that have the P largest values of Q are selected and fit to a 
line by a least squares linear regression. In this alternate embodiment, if the window 
W of the CFA image corresponds to a character bounding box, then P=4. If "op" 

th 

denotes the index such that Q is the p largest value of Q, then the estimate of 
the coefficient of a is given by: 0 * 3 

[0079] 

p 

^ ^ y op* op 

a =^ 

r P 

X op X op 

[0080] An analogous procedure is used to estimate the coefficient for the blue channel. 

[0081 ] In the alternate embodiment employing a weighted linear regression procedure, 
rather than the confidence procedure, a suitable variance measure for a pair of color 
sums is given by the quantity: 



M [0082] 

n 



na 2 +V k \ 



u 



L [0083] 5. Application Of Estimated Coefficients 

\± [0084] Once the coefficients "a" and "b" or "a" and "b" " in either Embodiment A or B, 
respectively, have been determined, the green channel of the image is obtained by 
applying the linear transformations set forth above. If a full color image is desired, 
missing red (blue) pixels may be determined at a sampled green or blue (red) pixel 
analogously. For a system operating using Embodiment A, the transformation for 
determining missing red pixels given green and blue pixels is as follows: 



[0085] 



R(y.x) = (G(y,x)-b r D{y.*))l<>r. 
R(y.x) = (a t B(y, x) + (A, - b r )D{y, x))la r . 



[0086] For a system operating using Embodiment B, the transformation for determining 
missing red pixels given green and blue pixels is as follows: 



[0087] 



R(y,x) = (aiBiy.xl + bS-b/Va,. 
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[0088] 6. Miscellaneous 

[0089] Parts of the system may be readily implemented in software using software 

development environments that provide portable source code that can be used on a 
variety of hardware platforms. Alternatively, parts of the disclosed system may be 
implemented partially or fully in hardware using standard logic circuits. Whether 
software or hardware is used to implement the system varies depending on the speed 
and efficiency requirements of the system and also the particular function and the 
particular software or hardware systems and the particular microprocessor or 
microcomputer systems being utilized. The system, however, can be readily developed 
by those skilled in the applicable arts without undue experimentation from the 
functional description provided herein together with a general knowledge of the 
computer arts. 

[0090] The invention has been described with reference to a particular embodiment. 

s I 3 

£0 Modifications and alterations will occur to others upon reading and understanding this 



yy 



specification taken together with the drawings. The embodiments are but examples, 
and various alternatives, modifications, variations or improvements may be made by 
those skilled in the art from this teaching which are intended to be encompassed by 
the following claims. 
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